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Abstract 

When integrated within a holistic literacy program, phonics applications can be used in classrooms to facilitate 
students’ self-directed learning of letter-sound knowledge; but are they designed to allow for such a purpose? 
With most phonics software applications making heavy use of image cues, this project has more specifically 
investigated whether the design of the images used in such applications may impact on the effectiveness of their 
self-directed use in classrooms. Using a quasi-experimental study, we compared two types of pictorial 
mnemonics used in tablet applications, along with teacher-led activity in three first-year classrooms from the one 
school. The difference between teacher-led activity and integrated picture cues was significant, with teacher-led 
activity proving more effective. The difference between teacher-led activity and form-taking picture cues, 
however, was not statistically significant. Given that the outcomes of this small-scale study suggest that image 
design may be a significant design feature contributing to the educational value of using phonics applications in 
the classroom, we recommend that the design features of phonics software applications attract further research. 
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1. Introduction 

Increased accessibility to digital technologies in schools has meant that teachers have been provided with 
alternatives to teaching that have paved the way for possible changes to their pedagogical practices (Hedberg, 
2011). In this study, we explore how digital technologies can be used in Foundation classrooms (the first year of 
schooling in Australia), to facilitate self-directed learning of lower-order, “closed” knowledge; a practice that has 
the potential to free teachers to devote more of their time engaging students in more meaningful teaching and 
learning activities. This paper responds to a move towards a “value added approach” to the research of digital 
technologies in education (“which features [of the software program and/or device] lead to learning?”) rather 
than a question of “effectiveness” per se (Clark & Mayer, 2011, p. 380). Specifically, our purpose was to 
investigate whether the type of pictorial representations used in phonics applications may be a significant design 
feature in software designed to support young learners to match letters to sounds, and whether larger-scale, more 
rigorous research on their design would be warranted. 

Learning the most common letter-sound association (Note 1) is considered a constrained skill (Pfost, Hattie, 
Dorfler, & Artelt, 2014) given the closed, correct-or-incorrect nature of this knowledge. Yet this knowledge has 
been found to be critical in assisting children to develop early reading skills (Australian Government: 
Department of Education, 2005; National Early Literacy Panel, 2008; Stanovich, 1986). Traditional drill and 
practice has been found to be an effective avenue through which to acquire and retain closed knowledge in 
memory (Joseph, Eveleigh, Konrad, Neef, & Volpe, 2012; Kirriemuir & McFarlane, 2006; Musti-Rao, Lo, & 
Plati, 2014; Yu, Liu, & Chan, 2005), yet it is questionable whether this is an effective use of teacher time when 
such a practice may be facilitated via other means (Jones, 2009; Kirriemuir & McFarlane, 2006; Musti-Rao et ah, 
2014). Digital technologies provide an alternative means by which to engage learners in self-directed drill and 
practice, working as a tutor by providing users with individualised feedback as they interact with software (Jones, 
2009; Musti-Rao et ah, 2014). Use of digital technology in this way, with small groups of children, has the 
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potential to release teachers to engage with others in learning experiences that encourage contextualised, deeper 
levels of thinking with or without the aid of digital technologies. 

Many studies have investigated the effectiveness of using computer aided instruction to support young children 
to acquire early reading skills, with the findings presented across studies proving inconclusive (Cassady & Smith, 
2005; Kartal & Terziyan, 2016; Korat & Blau, 2010; Macaruso & Walker, 2008; Nicolson, Fawcett, & Nicolson, 
2000; Paterson, Henry, O’Quin, Ceprano, & Blue, 2003). These studies explored the development of more 
open-ended reading skills such as comprehension and fluency, some considered variations in the strategies 
teachers used, while others included investigations into whether time on task would have an influence on a 
computer program’s effectiveness. Our study contributes to this body of research and seeks more conclusive 
findings by focusing on one design feature (pictorial cues) and a closed reading skill (common letter-sound 
mapping). The outcomes of our study have led us to conclude that when teaching approaches and student 
demographics are similar, and when time on task remain constant, there is sufficient evidence to suggest that the 
images that are used to support the learning of common letter-sound associations may have a significant impact 
on student learning, calling for further research into the design of phonics applications. We suggest that this is 
due to a possible misalignment between discourse and design. 

Kress and Van Leeuwen (2001) and Darcy and Auld (2008) describe four domains of practice through which 
multimodal texts such as those provided via electronic platforms make meaning: discourse, design, production 
and distribution. Discourses, as defined by Kress and Van Leeuwen (2001, p. 4) are “socially constructed 
knowledges”. Design transforms those knowledges into “social (inter-) action” (Kress & Van Leeuwen, 2001, p. 
5). Using Kress and Van Leeuwen’s description of the relationship that exists between these two domains, it can 
be argued that it is essential that discourse (here referring to theories of memory retention and the successful 
learning of letter-sound mapping) is actualised through the design of a digital software application, in order to 
achieve maximum educational value. When software design is motivational and appealing, with limited 
consideration for the cognitive processes required to foster learning, software runs the risk of falling within the 
“high behavioural activity/low psychological activity” quadrant of Clark and Mayer’s (2011, p. 370) e-learning 
“activity matrix”, stimulating little of the psychological activity that is required for learning. 

in this paper, we describe the discourses which informed the design of two letter-sound mapping software 
applications we created and trialled in this study (see also: “Acknowledgements”), before detailing the method 
used to compare the applications against teacher-led drill and practice. We conclude that there is sufficient 
evidence to suggest that the alignment of discourse and design is critical to the effective integration of digital 
technologies in early years classrooms when used as self-directed tasks, warranting further research into phonics 
software design. 

1.1 Background 

This section describes the theoretical concepts that informed the design of the two software applications used in 
this study, beginning with our choice of target content. 

1.1.1 Early Reading Skills 

In his seminal paper, Stanovich (1986) identified that a causal link exists between phonological awareness and 
reading acquisition, and more specifically that “conscious access to the phonemic level of the speech stream” 
enables early reading success (Stanovich, 1986, p. 362); a proposition that has been supported in subsequent 
studies (Ehri et ah, 2001; Hoien, Lundberg, Stanovich, & Bjaalid, 1995; Hulme et ah, 2002; Jongejan, Verhoeven, 
& Siegel, 2007; Muter, Hulme, Snowling, & Taylor, 1998). Stanovich cautioned however, that phonological 
awareness and the skill of letter-sound mapping need be acquired early in order to circumvent “a causal chain of 
escalating negative side effects” (1986, p. 364); a theory recently supported by Pfost et ah (2014), having 
conducted a review of longitudinal studies into reading skill acquisition and development. 

Perfetti (1991) suggested that it is computational knowledge (letter-sound mapping) more so than reflective 
knowledge (“conscious manipulation of the components of spoken words” (pp. 38-39)) that is critical in the early 
stages of reading acquisition. This suggestion has also been reflected in a meta-analytic review conducted by 
Melby-Lervag, Lyster, and Hulme (2012) who found that phonemic awareness showed the strongest correlation 
and was a “unique predictor of individual differences in children’s word reading skills” (p. 340) even after 
controlling for the effects of rime awareness and verbal short-term memory. Stanovich (1986) referred to the 
temporal relationship between phonemic awareness and reading acquisition as the “Matthew effect”, describing 
the growing gap in abilities between less advanced readers and their more able peers as they progress through 
their schooling. 
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The proposition that delayed letter-sound acquisition may lead to the delayed acquisition of other reading related 
skills, such as whole word recognition and the ability to gain meaning from text has had much support 
(Australian Government: Department of Education, 2005; National Early Literacy Panel, 2008; National 
Reading Panel, 2000; Rowe, 2005). These findings influenced our choice to focus on letter-sound mapping for 
this study. How to assist children to acquire this knowledge as quickly and effectively as possible is an area of 
research we turned to next. 

1.1.2 Drill and Practice 

The learning and recall of basic, “constrained” (Pfost et al., 2014, p. 209) knowledge, such as commonly used 
letter-sound mapping or high frequency sight words can be effectively facilitated via tasks that plan for repeated 
exposure and rehearsal, often referred to as “drill and practice” (Joseph et al., 2012; Musti-Rao et al., 2014; Yu et 
al., 2005). Studies that have found drill and practice to be less effective than other teaching strategies, have 
focused on knowledge that is process based (e.g., addition, or the reading of passages of text) rather than 
constrained knowledge alone (Musti-Rao et al., 2014; Tournaki, 2003). While there is some criticism for drill 
and practice (Booker, 2007), when strategically incorporated into a holistic, multi-faceted curriculum that 
integrates contextualisation, process and knowledge acquisition, drill and practice can facilitate the recall of 
constrained knowledge, which more cognitively demanding processes of thinking can build upon (Booker, 2007; 
Pfost et al., 2014; Seaman, 2011). 

It is recognised that knowledge is personalised, often differing to the knowledge held by others (Cobb, 1994; 
Nicholas & Paatsch, 2014). It can therefore be appreciated why early users of digital technologies in education, 
acknowledging this newfound avenue through which to individualise learning, made use of drill and practice for 
closed “pre-determined learning goals”, offering users instant feedback as they engaged in “self-paced learning” 
(Jones, 2009, n.p). The use of digital technologies is not restricted to tasks that require the individualised 
rehearsal of pre-determined outcomes however. Since the 1970s, beginning with the introduction of 
“computer-mediated communication” there has been an appreciation that digital technologies can also be used to 
facilitate thinking processes that require collaboration, scaffolding and shared experiences (Jones, 2009, n.p). 

Higher order thinking tends to require more teacher input as engagement in open learning requires more 
guidance, problem solving, judgement and contextualised adaptation. It is for this reason that Kirriemuir and 
McFarlane (2006) caution that while: 

...drill and practice’ is a proven principle of education and learning..., it is questionable whether such 
activities should occupy a significant part of the school day, where children have access to teachers, 
resources and more demanding and creative learning tasks (p. 25). 

There is now growing appreciation for the fact that the integration of new technologies into the classroom, such 
as the use of digital mobile devices has provided teachers with greater means by which to diversify their teaching 
and provide positive changes to pedagogical practices (Hedberg, 2011). When the rehearsal and recall of 
constrained content knowledge can be facilitated independently of the teacher following some initial instruction, 
this frees the teacher to devote more time to the teaching of more cognitively demanding elements such as those 
listed by Yu et al. (2005) and Seaman (2011) including elaboration, organisation, evaluation, analysis and 
synthesis. When strategically introduced by the teacher as an additional and complementary avenue through 
which to target specific content knowledge (Cheung & Slavin, 2012; Flewitt, Messer, & Kucirkova, 2014), there 
is good argument for using phonics applications in the classroom. 

1.1.3 Digital Technologies in the Classroom 

Handheld digital devices are now common artefacts that are often used both in class and out by primary school 
aged children and many of their younger counterparts (Danby et al., 2013; Given et al., 2014; Walsh, 2011), with 
tablets and hand-held devices identified as the most commonly used devices amongst 3-5 year olds (Given et al., 
2014). Whether or not children have access to digital technologies in their homes, digital devices can provide a 
highly stimulating and motivational environment for young students, with applications designed to support 
children’s literacy development demonstrating positive outcomes for users (Flewitt et al., 2014; McManis & 
Gunnewig, 2012; Reid & Ostashewski, 2011). 

Watlington (2011) and Reid and Ostashewski (2011) highlight the new possibilities readily available through the 
use of tablet devices, when one considers how new functionalities (e.g., taps, swipes and pinch-zooms), coupled 
with a light, robust textbook-size screen have paved the way for the development of thousands of specifically 
designed educational applications. Comparing traditional classroom media to digital media, Stern (2014) and 
Flewitt et al. (2014) found that young learners have a preference for the digital. While the authors noted that 
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digital presentation of content can be considered novel by young learners, it was the broader preference for 
“dynamic visual content, a high degree of interactivity, and the presence of multimodal interaction” (Stern, 2014, 
p. 923) that attracted students to tablet devices. Further to this, Neumann (2016), Ihmeideh (2014) and 
Noorhidawati, Ghalebandi, and Hajar (2015) have all presented findings that indicate that the use of mobile 
devices can encourage more than engagement, with their collective research outcomes indicating that mobile 
devices can also allow for cognitive, psychomotor-based and affective learning. 

As highlighted by Cheung and Slavin (2012) however, use of digital technologies in classrooms is most 
successful when partnered with teacher instruction. Teacher attitude is also critical with positive attitudes 
towards the adoption of digital technologies playing a key role (Blackwell, Lauricella, & Wartella, 2014). 
Principals (headmasters) and classroom teachers ultimately determine the adoption of digital technologies and 
associated applications in classrooms, more so than availability on its own (De Grove, Bourgonjon, & Van Looy, 
2012; Flewitt et al., 2014; Reid & Ostashewski, 2011). There is therefore a need to provide teachers with 
complementary professional development in suitable approaches to support children’s literacy and language 
development (Landry, Swank, Anthony, & Assel, 2011). With a multitude of phonics applications available, it is 
essential that teachers are not only attracted to the potential for self-efficacy that such applications provide, but 
can also appreciate how particular applications successfully combine discourse and design (Kress & Van 
Leeuwen, 2001) to support their programs as a whole. 

Stern (2014), Flewitt et al. (2014) and Reid and Ostashewski (2011) suggest that careful consideration must be 
given to both interaction-based elements and content-based material, when designing content for presentation on 
digital devices for young learners. They suggest that the design should support both interactive and independent 
learning experience, that feedback be clear, immediate and rewarding, and that distractors that divert attention 
from the primary content must be minimised. Though the final point included the elimination of links to social 
media and other such distractors, this suggestion has a strong connection with research that has investigated 
theories into the effective retrieval of target memories, and the theory of “interference”. 

1.1.4 Dual Coding Theory 

Digital technologies allow for a variety of modes through which to present target content; and with this, a variety 
of mode through which to introduce potential distractors. When designing the gaming applications to be trialled 
in our study, we needed to make considered decisions that would ensure that our designs were aligned with 
discourses on how children acquire phonemic knowledge, while planning to minimise any foreseeable distractors. 
The alphabet, or any target content for that matter, can be difficult to recall for a number of reasons. According to 
Levin (1993) there are four main factors; the amount to be remembered, unfamiliarity, the complexity of the 
content and abstractness. In the case of the alphabet, it could be argued that all of the above apply. This is not 
surprising given that research into the origins of the alphabet has found that its Flebrew, Greek, German and 
Gaelic creators at least, considered the alphabet a mnemonic device, used to aid and as a substitute for memory 
so that communication was not constrained by the capacity of the human mind or a particular time or place 
(Stecchini, 1961). 

Over the past 30 years it has been noted that several studies have sought to investigate how the learning of letter 
sound associations may be effectively facilitated through the strategic use of pictorial mnemonics. The focus on 
pictorial mnemonics appears to have been influenced by supporters of Paivio and Csapo (1973) who postulated 
that the visual mnemonic’s effectiveness might be due to “dual coding theory”. Dual coding theory is based upon 
the assumption that the memory utilises two encoding systems; one is visual and relates to concrete things 
referred to as imagens, the other being verbal and relating to abstract linguistic units referred to as logogens 
(Paivio & Csapo, 1973; Sadoski & Paivio, 2013). It is theorised that these systems can work independently or 
jointly for “encoding, storage, organization and retrieval of stimulus information” (Paivio & Csapo, 1973, p. 
177). 

When learning letter-sound associations, this theory suggests that the abstract letter shape that is used to 
symbolise a unit or units of sound will be more effectively stored, organised and retrieved from memory if it is 
associated with a visual connection (e.g., the symbol “a” and sound /a/ are associated with an image of an apple). 
Attempting to connect an abstract logogen (the symbol of the letter) to another abstract logogen (a unit or units 
of sound) without the additive benefit of an imagen demands that memory rely heavily on only one encoding 
system. This theory assigns the visual memory cue with superior additive properties, positing that if one code 
fails there is another that can be used to retrieve the sought after information (Kuo & Flooper, 2004; Paivio & 
Csapo, 1973). 
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1.1.5 Pictorial Mnemonics 

Reviewing the literature on pictorial mnemonics used to facilitate letter-sound mapping, we came to identify 
three distinct pictorial categories used across the studies: 

a) The disassociated: Pictorial mnemonics of recognisable objects in their natural state are used to 
facilitate recall. The letter symbol does not occupy the same space as the image (de Graaff, Bosman, 
Hasselman, & Verhoeven, 2009; Ehri, Deffner, & Wilce, 1984; Manalo, Uesaka, & Sekitani, 2013; 
Shmidman & Ehri, 2010); 

b) The form takers: Pictorial mnemonic cues take on the form of their corresponding letter symbol. As 
with the disassociated cue, the letter symbol does not occupy the same space as the image (Dilorenzo, 
Rody, Bucholz, & Brady, 2011; Gregory, 1984) (Note 2); 

c) The integrated: Pictorial mnemonic cues and their corresponding letter symbols are combined so that 
the image and letter symbol are identifiable while occupying the same space (Agramonte & Belfiore, 
2002; de Graaff et al., 2009; de Graaff, Verhoeven, Bosman, & Elasselman, 2007; Dilorenzo et al., 2011; 
Ehri et ah, 1984; Fulk, Lohman, & Belfiore, 1997; Gregory, 1984; Hoogeveen, Smeets, & Lancioni, 
1989; Manalo et ah, 2013; Sener & Belfiore, 2005; Shmidman & Ehri, 2010). 

These studies investigated the acquisition of letter-sound knowledge with English first language users, Dutch 
first language users, Turkish students learning the English language, English language users learning the Hebrew 
alphabet and Japanese students learning the English language. It was de Graaff et ah (2007) who first began to 
explore the effectiveness of using artificially embedded pictorial mnemonics within an electronic environment 
with a follow-up study conducted two years later (de Graaff et ah, 2009). 

In most of the studies listed above, the pictorial cues used across categories have been chosen to represent nouns 
which begin with the same initial unit of sound (phoneme) as the most common sound represented by their 
corresponding letter symbols (graphemes). For example, though the letter “g” can be representative of the 
phoneme 1)1 (e.g., “giraffe”), an image of a pair of “glasses” is used to remind its user that the grapheme “g” 
most commonly represents the /g/ phoneme in the English language (Ehri et ah, 1984). 

With the exceptions of Dilorenzo et ah (2011) and Gregory (1984) who made use of a mixture of the second and 
third categories, the above named authors chose the integrated mnemonic as their main focus of inquiry. There 
are varying degrees to which an image and corresponding letter symbol may be integrated however, pointing to a 
possible mismatch in the way in which “integration” and/or “embedment” has been understood and used by each 
author. This finding prompted us to address this oversight in the literature through a valued-added study which 
has investigated whether the design of images used in self-directed phonics software applications must be more 
distinctly integrated with the letter symbol they are to represent in order to minimise potential distractors. 
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Figure 1. Integration scale 
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Interference 




Letter Symbol 



Figure 2. Interference 


Disassociated Cue 


The scale found in Figure 1, produced by the authors, shows that pictorial mnemonics can vary in the degree to 
which an illustration can take on the form of its corresponding letter symbol. As we move along the scale from 
“form taking” to “disassociated”, the mnemonic cue takes on less features of its matching letter symbol so that 
the link between image and letter becomes increasingly artificial. The problem with artificial links is that other 
memories may interfere with recall. For example, in Figure 2 we see that the letter S is similar in shape to the 
garden hose. Individuals who are reminded of this connection may be led to their memory of a hose, which may 
distract and inhibit their ability to arrive at their target memory. This distraction has been referred to as 
“interference” (Danker, Fincham, & Anderson, 2011; Kline, 1921; Van Dyke & Johns, 2012), which has been 
suggested as the main cause of “forgetting” target content (Levy & Wagner, 2013, p. 3). 

1.1.6 Minimising Interference 

For centuries people have attempted to combat interference and to aid recall using two main types of 
organisational mnemonics; the peg-type mnemonic (use of numbers to aid the recall of information) and the 
chain-type mnemonic (the process used to recall a long list of items) (Bellezza, 1981; Levin, 1993). In both 
instances the aim is to unitise information into a connected, meaningful whole in memory so that retrieval is 
more ably facilitated (Bellezza, 1981). The chain-type mnemonic in particular can be thought of as a 
purposefully devised path that leads the learner from one piece of information to the next until one has arrived at 
the desired target, without being led astray by competing memories. 

Designing an image to take on the shape of its corresponding letter symbol is a technique that aligns itself with 
the use of chain-type mnemonics. It has been suggested that some mnemonic devices, such as pegword 
mnemonics, have little or no connection to the concept being learned, resulting in many labelling this type of 
connection “artificial” (Bellezza, 1981). Artificially or “forcibly” integrating the shape of a letter into an image is 
an example of using two disparate concepts, in an attempt to aid recall. Moulding an image to take on the shape 
of a letter, using characteristics which both the image and the letter shape share creates a more concrete 
connection, which theoretically should be more effective at aiding recall while minimising potential distractors. 

1.2 Current Study 

The focus of most studies exploring the use of pictorial cues to facilitate the learning of letter sound associations 
has centred on establishing whether using an integrated mnemonic device is more effective than not. What these 
studies have not considered is whether mnemonic devices which attempt to make use of concrete links to 
common objects may prove more effective than those which seek to artificially manufacture such links. Five of 
the twelve listed studies compared the integrated mnemonic against the disassociated cue in an effort to establish 
relative effectiveness. Of these, four declared the integrated mnemonic more effective than the disassociated cue 
(Ehri et ah, 1984; Manalo et ah, 2013; Shmidman & Ehri, 2010) (Note 3), with the fifth presenting with 
inconclusive results (de Graaff et al., 2009). Only two articles included the “form taking” pictorial mnemonic 
(Dilorenzo et al., 2011; Gregory, 1984). In both of those studies, form taking mnemonics were grouped with 
integrated mnemonics, with no reference to or distinction made between categories. We are also presented with 
an intriguing finding in that de Graaff et al. (2007) found that the printed integrated pictorial mnemonic was less 
effective than learning letter symbols in isolation by rote. 

This study therefore, explored two avenues of enquiry: 
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1) to investigate whether the design of the pictorial representations used in phonics applications 
(concrete [form taking] versus artificial [integrated]) may be a significant design feature of mobile 
applications designed for children learning common letter-sound matches in their first year of schooling; 
and 

2) to investigate whether use of the phonics applications under trial were comparable to teacher-led drill 
and practice. 

2. Method 

This study utilised a paper-based pre and post-test (detailed under “data collection and analysis”) conducted 12 
weeks apart, to determine children’s knowledge of the most common letter-sound associations before and after 
the trial period. The trial period took place over a 10-week period in the second term of a four-term school year. 

2.1 Mobile Application—A to Z Safari (Version 1.0; Nicholas, McKenzie, Spence, & Wells, 2015) 

A to Z Safari was designed to support interactive engagement with the English alphabet via games that ask 
students to match letter symbols to their most commonly associated unit of sound. A to Z Safari was designed for 
5 to 7 year old children for use in the first-year classroom but can be used by any child who is ready to start 
learning letter-sound mapping. The font used in the applications was Victorian Modern Cursive as this was the 
font used in the trial school, as is the case with most government schools in the state of Victoria, Australia. 

The design of A to Z Safari consisted of a welcome screen, two mini-games (match and memory); and an 
administration area where classroom teachers can create a class list and begin or end “assessment sessions” to 
enable the gathering of user statistics. To address our first research question, two versions of the mobile 
application were created and used in two trial first-year classrooms. One classroom used the mobile application 
with form taking pictorial cues (Figure 3). A second classroom used the same mobile application, however the 
pictorial mnemonics had been replaced with integrated pictorial cues (Figure 4). 



Figure 3. A match game using “form taking” picture cues 



Figure 4. A match game using “integrated” picture cues 
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2.2 Participants 

The elementary school where the trial took place was located in the state of Victoria, Australia. The school had 
270 students with an Index of Community Socio-Educational Advantage (ICSEA) rating of 958 (below the state 
mean of 1000), indicating that the students and their families were from a lower socio-economic background 
than many of their counterparts across the state. Twenty-two percent of the families enrolled at the school came 
from a Language Background Other Than English (LBOTE), which was representative of the percentage of 
LBOTE families present in each of the three first-year groups/classrooms that took part in the trial. 

Three groups of first-year children (n = 16 in each group), ranging in age from 5-6 and their three teachers 
participated in this research. The groups can be described as: 

a) FTC-SD: Children (n = 8 males; n = 8 females) who used the self-directed mobile application which 
made use of form taking letter-picture cues (Figure 3); 

b) IC-SD: Children (n = 6 males; n = 10 females) who used a modified version of the self-directed 
application used by FTC-SD. The IC-SD version replaced the form taking images with integrated 
letter-picture cues. All other aspects/features of the application remained the same (Figure 4); 

c) TLI: Teacher-led instruction. These children (n = 5 males; n = 11 females); did not use tablet 
devices. 

The FTC-SD class had one male student with a refugee background. He spoke minimal English at the 
commencement of the study but was able to identify the letter at the beginning of his name and its most 
commonly associated sound during the pre-test. All other LBOTE children across all three classes engaged in 
clear English oral communication at the commencement of the study. Each group is described in greater detail 
below. 

2.2.1 The Three Groups 

As detailed in their post-testing interview, the teachers of all three classes met weekly to ensure that they all had 
“the same letter and sight word focus for the week” (TLI-teacher). They would also touch on other letters already 
learnt as part of their teaching program. They ensured to watch the same alphabet song available online once a 
week. 

We bring our ideas to the team. We then leave with all of those ideas and you can allocate what day 
you’d like to do those activities (TLI-teacher). 

All three classes engaged in “guided reading” sessions where small groups of 3-5 children read teacher chosen 
texts targeted at their instructional reading level, with teacher guidance; and “reading rotations” where children 
engaged in reading and writing related activities at their tables, largely independently or with peer collaboration. 
All three classes had printed alphabet charts on their tables (strips with images and letters). The TLI class used 
disassociated images, the FTC-SD class made use of a strip with the same form taking images used in the Figure 
3 phonics application. The IC-SD class had strips with integrated picture cues, as shown in Figure 4. 

2.2.2 Use of Devices 

One classroom used the FTC-SD digital application (Figure 3) while the other used the IC-SD application 
(Figure 4). Both groups had access to four iPads each, which allowed the children to use the tablets during small 
group tasks on a rotational basis, ensuring that each child was scheduled to use the iPad once a week. In both 
classrooms the teachers reported using the digital application in the first week of term as a teacher-led small 
group guided instruction session. The children needed some direction in the second week but following this, the 
children used the devices in a self-directed capacity as part of their “reading rotations”. The teacher of the 
FTC-SD classroom commented that: 

.. .we always checked with each other before we did anything just to make sure that the other person is 
doing the same. 

2.2.3 Teacher-Led Group 

In this classroom, as detailed by the teacher in the post-testing interview, there was a lot of teacher-led “rote” 
learning “...a lot of consistency of going over and over... until they get it” (TLI-teacher). Use of an alphabet 
song displayed on the interactive whiteboard would be accessed once a week whole class as with the other two 
classrooms, however the TLI-teacher’s instruction differed to the other two groups in the following way: 
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At the end all the [disassociated] pictures are up there... I write it [a letter symbol] over the [interactive] 
whiteboard with the picture because at the very end it shows all the pictures from the song ... whole 
class. 

Beyond this, the TLI-teacher would direct the children to an alphabet wall with letters and disassociated image 
cues displayed in the classroom at least once a week: 

When we go over it we do say it three times and 1 think it just helps to stick in their brain... up here on 
the board [word wall]. So we’d say “Y”, /y/, /y/, /y/ ... “H”, /h/, /h/, /h/, “M”, /m/, /m/, /m/, so they were 
doing it all together... moving away from the song. 

The letter symbols on the word wall were displayed randomly. TLI-teacher indicated that this was to ensure that 
children were not learning the letters in alphabetical order. Each child would also be asked to choose five letters 
that the class had already learnt, telling the teacher and whole class what the letter is, the sound it makes and a 
word that starts with that sound. They were then to choose a letter that hadn’t yet been a focus letter, or to ask a 
friend for help: 

Most kids choose the first letter of their name and so then that’s teaching the other kids. 

In addition to this weekly practice, the teacher indicated that she differed her approach to the other teachers by 
working with small groups of students who “were still struggling”, for five minutes following her reading 
sessions. We have interpreted this to suggest that the teacher worked with small groups of children who were not 
learning letter-sound matches as quickly as their peers. This occurred three-five times per week, using cards with 
letters and disassociated images. The children were to identify the name of the letter, the most common sound 
the symbol represents, and words that begin with that sound. 

2.3 Data Collection and Analysis 

Each child in each group was assessed using a paper-based randomised letter-sound recognition test using the 26 
letters of the English alphabet presented in Victorian Modern Cursive font. Children were asked to give the most 
common sound that each letter represents until they had given an answer for each letter or indicated that they did 
not know a particular letter/s yet. Children were assessed at the beginning of term 2 (entry data) and after 10 
weeks, at the end of term 2 (exit data). Student scores were presented as totals (how many letter-sound matches 
did each child identify correctly?). The data in the results have been presented in aggregate (divided into 
quartiles, averages and via independent-samples t-tests), ensuring that the children are non-identifiable. 

Within-group distribution of outcomes was calculated at the beginning of the testing period by dividing each 
group’s outcomes into quartiles. An independent-samples t-test was carried out to identify whether differences 
between classes were of statistical significance before the trial period began. Another independent-samples t-test 
was conducted at the conclusion of the ten week trial to determine statistical significance between groups 
following the trial. 

A focus group interview and individual interviews were conducted with teacher participants following the trial 
period to clarify which teaching approaches were similar and which approaches differed between classes 
throughout the trial, providing teachers the opportunity to give feedback about the applications’ design features. 

3. Results 

The following tables show the data that was gathered using the pre and post-test student results. 


Table 1. Entry score data: letter-sound knowledge divided into quartiles 



IC-SD 

FTC-SD 

TLI 

Min 

1.00 

1.00 

1.00 

1st quartile 

5.00 

3.00 

3.50 

median 

10.50 

9.50 

10.00 

3rd quartile 

13.50 

15.00 

17.00 

Max 

21.00 

19.00 

23.00 

AVERAGE 

9.88 

9.56 

10.19 
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Table 2. Statistical significance: pre-test letter-sound knowledge 



Mean 

Observations 

df 

t Stat 

P = 

FTC-SD 

9.563 

16 

29 

-0.256 

0.8 

TLI 

10.188 

16 





Mean 

Observations 

df 

t Stat 

P = 

IC-SD 

9.875 

16 

29 

0.130 

0.897 

TLI 

10.188 

16 





Mean 

Observations 

df 

t Stat 

P = 

FTC-SD 

9.563 

16 

30 

0.142 

0.888 

IC-SD 

9.875 

16 





It cannot be assumed that students will enter their first year of schooling without any knowledge of letter-sound 
associations (Nicholas & Paatsch, 2014). Further to this, the collection of entry data and project implementation 
occurred during the second school term, three months into the school year. Given none of the student participants 
included were new to their class, all child participants had been exposed to letter-sound mapping instruction (at 
least one focus letter a week) prior to the trial. This is reflected in Table 1 with a great spread of knowledge 
evident in all three classes at the commencement of the study. In all classes there was at least one child who only 
knew one letter-sound association along with children who knew 19 or more. 

Table 1 shows that a quarter of the children from the FTC-SD group and a quarter from the TLI group knew 1-3 
letter-sound associations. This was slightly lower than the IC-SD group’s first quartile, which shows that a 
quarter of student knew 1-5 letter-sound associations. The TLI group had the highest performing fourth quartile. 
A quarter of the children from this class began the second term knowing 17-23 letter-sound associations. There 
was also a greater spread amongst this group’s third quartile with a quarter of students knowing 10-17 letter 
sounds. The FTC-SD class had the poorest performing fourth quartile with a quarter of children knowing 15-19 
letter-sound associations. 

Though the TLI class had the highest average (10.188), and the FTC-SD group presented with the lowest 
average (9.56), Table 2 shows us that the differences in scores across the three classes were not of statistical 
significance. Following the 10 week trial, outcomes of statistical significance were found between two of these 
groups, as shown in Table 3. 


Table 3. Statistical significance: exit letter-sound knowledge 



Mean 

Observations 

df 

t Stat 

P = 

FTC-SD 

19.875 

16 

26 

-1.072 

0.294 

TLI 

21.875 

16 





Mean 

Observations 

df 

t Stat 

P = 

IC-SD 

17.438 

16 

27 

-2.431 

0.022 

TLI 

21.875 

16 





Mean 

Observations 

df 

t Stat 

P = 

FTC-SD 

19.875 

16 

30 

-1.134 

0.267 

IC-SD 

17.438 

16 





Table 3 shows that though the use of the FTC-SD digital application yielded a higher average than the IC-SD 
digital application, with FTC-SD children identifying on average 2 more letter-sound associations than their 
IC-SD peers, no significant difference was found between the two groups: t(30) = -1.134, p = 0.267 (FTC-SD M 
= 19.875; IC-SD M = 17.438). When comparing users of the FTC-SD digital application to the TLI group there 
is a similar gap between group average scores, with students from the TLI group identifying 2 more letter-sound 
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associations than their FTC-SD peers: t(26) = -0.1.072, p = 0.294 (FTC-SD M = 19.875; TLI M = 21.875), 
which again indicates that there was no significant difference between the two groups. 

Comparing the IC-SD group with the TLI group however, the difference in outcomes was found to be of 
statistical significance, with an average of 4 letters the difference between the two groups as a whole: t(27) = 
-2.431, p = 0.022 (IC-SD M = 17.438; TLI M = 21.875). 

4. Discussion 

The results from this study suggest that the use of a self-directed digital application, designed to assist children to 
acquire knowledge in letter-sound mapping, can be as effective as teacher-led instruction, when the application 
makes use of more concrete, form taking pictorial cues. The self-directed use of a digital device may not prove to 
be as effective as teacher-led instruction, when the software application makes use of more artificially integrated 
picture cues. 

The findings from this project also suggest that digital devices can be used by young children in classrooms 
independently of their teacher to effectively support the acquisition of constrained knowledge such as 
letter-sound mapping, via drill and practice type gaming applications. FTC-SD-teacher shared that the use of 
tablet devices in her classroom meant that: 

There was a group that was really being focussed and working on their own which doesn’t always 
happen when the teacher is working with one group in Prep [the “Preparatory” classroom—also known 
as “Foundation”], so they were really on task. 

This is reflective of Stern (2014), Flewitt et al. (2014) and Jones (2009) who submit that a benefit of using digital 
devices is that they can offer opportunities for self-directed, independent and individualised learning due to the 
ways in which software can provide automatic feedback within a visually appealing context. 

This is an important finding, suggesting that one use for the digital device in the classroom may be to integrate 
their use into the curriculum to allow teachers to devote more time to engaging children in tasks that require 
more open ended, contextualised, higher order thinking. There was also suggestion from FTC-SD-teacher that 
the use of one-to-one devices has the potential to be more effective than whole class drill and practice given that, 

...there are always going to be the ones that just pick it up from listening, pick it up from reading at 
home, pick it up from doing their home cards and things and then there’s the ones that don’t do any of 
that, who don’t tune in on the floor or don’t do any practice at home and then they are the ones that 
really benefit from using something like this [the application]... it really helped the lower fraction stay 
on task a little bit more and learn some sounds. 

With the outcomes of the IC-SD group not proving as effective as teacher-led instruction, this investigation 
highlights that digital devices will not facilitate positive outcomes on their own, suggesting that the design of the 
images used in phonics software applications may be a significant factor contributing to their successful 
integration in classrooms. The IC-SD application only differed to the FTC-SD application in its image cues; all 
other design features and functionalities were identical. This finding provides some support for the suggestion 
that the design of phonics applications is critical if used to support the self-directed acquisition of lower order, 
closed knowledge. Reflective of suggestions made by Kress and Van Leeuwen (2001) and Clark and Mayer 
(2011), when selecting applications for use in classrooms it is essential to consider whether discourses (here 
referring to theories of memory retention, interference and the use of pictorial mnemonics to successfully learn 
letter-sound mapping) have been actualised through the design of the chosen application in order to facilitate 
learning most effectively, along with engagement. 

4.1 Limitations 

Our child participants used the phonics applications under trial for 20 minutes, once a week for 10 weeks. This is 
significantly less than findings that suggest that the most effective phonological instruction lasts between 5 to 18 
hours, with longer-lasting programs providing more positive, statistically significant results (Ehri et al., 2001). 
Our sample size was also small. 

Time spent using the phonics applications was comparable to time spent whole class on rote learning in the 
teacher-led classroom, but did not include the additional time spent with small groups of children (15-20 minutes 
a week) who needed more support in the teacher-led group. 

The outcomes of this trial study provides support for a longer lasting testing period, a larger sample size and 
requesting that teacher-led instruction be limited to the same amount of drill and practice as classrooms using 
digital applications, to investigate whether image design is as critical in phonics applications as we have 
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surmised. Given the ethical implications of requesting that teachers limit time spent with students, it is unlikely 
that a more controlled study can be undertaken. However, longer testing periods with larger sample sizes are a 
possibility that merit further investigation. 

4.2 Conclusion 

The results of this study suggest that when comparing the use of a self-directed digital application designed to 
assist young children to acquire knowledge in letter-sound mapping with teacher-led instruction, an application 
that makes use of form taking pictorial cues can be as effective as teacher-led instruction. A digital application 
that makes use of integrated pictorial cues however will not prove as effective as teacher-led instruction, when 
young children are working as self-directed learners. This suggests that pictorial mnemonics are most helpful for 
children who are beginning to learn letter-sound associations, when there is a more concrete and meaningful 
relationship between the image cue and letter symbol, indicating that image design is a critical design feature 
when creating educational phonics applications. 

This study makes a good case for the use of digital devices in early years classrooms to support and supplement 
holistic classroom practices that plan for and accommodate both lower order and higher order thinking (using 
both print and digital texts) from the first year of schooling. It also cautions educators to make use of their 
professional judgement when choosing the digital applications they wish to integrate into their curriculum. 
Choice needs to be informed by inquiry into whether the application is highly engaging, while also promoting 
the psychological activity required to facilitate learning (Clark & Mayer, 2011). This necessitates consideration 
of whether there is an alignment between discourse and design (Kress & Van Leeuwen, 2001). 
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Notes 

1) In this paper we refer to the most “commonly used letter-sounds” for each of the 26 letters of the British English 
alphabet (Gontijo, Gontijo, & Shillcock, 2003), rather than every sound that every letter can represent on its own or 
in combination with others; a starting point used to initiate a child’s awareness of letter-sound mapping. 

2) Gregory’s (1984) article is a descriptive report. No data has been provided to support his discussion. 

3) Ehri et al. (1984) details two separate experiments in the one article. 
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